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ABSTRACT 


In  many  estimation  problems,  Incomplete  as  well  as  complete 
samples  are  available  for  Bayesian  prediction.  After  develop- 
ing the  theory  for  a special,  but  useful  family  of  dlstrlbu- 
I tlons,  examples  are  given  In  life  testing,  renewal  risk  pro- 

cesses, life  contingencies,  and  the  problem  of  estimating  a 
defective  distribution. 


BAYESIANS  LEARN  WHILE  WAITING 


by 

William  S.  Jevell 


1.  INTRODUCTION 

In  Bayesian  prediction  problems,  one  is  Interested  In  using  observed 
values  of  a given  process  to  update  the  prior  knowledge  about  the  process 
parameters,  and  thence  to  make  better  predictions  about  the  process  Itself. 
Most  of  the  theory  concerns  Itself  either  with  exact  calculations  using  so- 
called  natural-conjugate  families  of  prior  and  likelihood  distributions  [1], 
or  with  best  linear  least-squares  approximations,  referred  to  in  the  actu- 
arial literature  as  credibility  theory  [3].  However,  both  approaches  con- 
sider only  the  use  of  complete  data  samples. 

The  purpose  of  this  paper  Is  to  show  that  there  are  many  situations  In 
which  inaomptete  observations  also  provide  updating  Information,  that  Is, 
Bayeslans  can  learn  while  waiting  for  the  finish  of  the  sampling  experiment. 
After  developing  the  necessary  theory  and  Introducing  the  gamma-proportional- 
hazard  family  of  distributions  most  appropriate  for  Incomplete  data  formula- 
tions, examples  are  given  from  life  testing,  renewal  risk  processes,  and  life 
contingency  reserving.  It  Is  shown  In  what  sense  an  Individual  life  (or 
cohort  of  similar  lives)  can  learn  about  his  (their)  own  remaining  lifetime 
distributions  with  the  passage  of  time.  The  paper  concludes  with  the  prob- 
lem of  estimating  the  parameters  and  the  defect  In  a defective  distribution. 
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2.  MODEL 

As  is  usual  In  Bayesian  models,  we  assume  that  x , the  random  life- 
time of  interest,  has  a likelihood  distribution  function,  P(x  | 0)  , which 
depends  upon  an  unknown  random  parameter  0 which  has  a prior  distribution 
function,  P(6)  . We  use  P^  » 1 - P to  denote  the  complementary  distribu- 
tion (or  survival)  function,  and  we  assume  that  (continuous  or  discrete) 
densities  exist,  denoted  by  p(x  ] 0)  , p(0)  etc. 

The  basic  problem  is  to  use  observational  data,  sampled  from  the 
likelihood  distribution  with  fixed,  but  unknown  parameter,  in  Bayes*  law  to 
find  the  posterior-to-data  distribution  of  the  parameter,  and  thence  to  pre- 
dict various  moments  and  economic  functions  of  the  underlying  lifetime  pro- 
cess. 

To  Illustrate  the  natural  way  in  which  Incomplete  samples  arise,  we 
consider  a life-testing  scheme  in  reliability  [4],  in  which: 

1.  N items,  all  with  lifetimes  drawn  as  samples  from  P(x  | 0)  with 
common  and  fixed  0 , are  put  "on  test"  at  epochs  {t^}  , and  re- 
moved from  test  at  epochs  (t^  + T^}  , (1,1,2,  ...,  N)  ; 

2.  C of  these  items  (with  Indices  in  the  set  C)  will  have  failed 

before  removal  with  observed  lifetimes  (x^^  ” ^ (1  e C)  ; 

3.  The  remaining  lifetimes  are  not  completely  observed,  since  the 

items  are  still  operating  at  removal,  so  it  is  known  only  that 
{\  > T^}  (1  ^ C)  . 

Depending  upon  the  experimental  protocol,  the  {T^}  may  be  fixed  in 
advance,  giving  then  a random  C ; or,  C may  be  fixed  in  advance  for  a 
simultaneous  test,  giving  a common,  random  time-on-test,  T . Considering 
for  a moment  that  the  {T^}  are  fixed,  and  denoting  the  observed  data  by 
P - (yi,72>  •••.  i C}  , where 
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(2.1) 


(1  e C) 

(i  ^ C)  ’ 


we  can  easily  argue  that  the  likelihood  density  of  this  data  set,  given 
6 , Is: 


(2.2)  p(I?  I 9)  - n p(x,  I 0)  n I 9)  . 

ieC  ^ iiC  ^ 

Bayes'  law  then  gives  the  predictive  density  for  continued  testing  of 
Items  j ^ C , or  for  future  experiments  on  other  Items  with  the  same 
parameter  value: 


(2.3) 


P(x  1 V) 


^ p(x  1 0) 

J 

£(2_L9iPi9) 

j p(P  I 0’)p(9')d0' 


d9  . 


The  ratio  In  square  brackets  Is  the  posterlor-to-data  parameter  density, 

p(0  I V)  . 

(2.2)  Is  also  useful  for  many  other  life  testing  protocols.  Suppose 
that  all  Items  are  put  on  test  at  the  same  epoch;  the  common  testing  Inter- 
val T need  not  be  fixed  In  advance,  but  may  be  a continuously-evaluated 
stopping  rule,  a possibly  random  decision  to  stop  experimenting  that 
depends  upon  the  values  {xj^,X2,  ...»  x^,  ; C}  observed  up  to  and  Including 
T , but  not  directly  upon  0 . In  this  case,  the  likelihood  includes  addi- 
tional terms  relating  to  the  stopping  rule  that  cancel  out  of  the  ratio  In 

(2.3);  the  stopping  rule  is  non-informative,  and  the  likelihood  kernel 
(2.2)  is  sufficient  for  9 . For  Instance,  one  could  stop  after  the  fifth 
failure,  or  at  T equal  to  twice  the  first-observed  complete  lifetime. 
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THE  PROPORTIONAL-HAZARD  FAMILY 

The  calculation  of  (2.3)  can,  of  course,  be  carried  out  by  computer 
for  any  given  prior  and  likelihood  distributions.  However,  for  model- 
building, it  is  desirable  to  use  a family  of  distributions  in  which  the 
calculations  are  especially  tractable  so  that  parametric  behavior  can  be 
analyzed  theoretically.  Unfortunately,  the  Koopman-Pitman-Darmois  expo- 
nential family  of  distributions  so  useful  in  credibility  theory  [2]  has  no 
simple  form  for  P 

However,  a special  case  of  the  exponential  family,  the  proportional- 
hazard  family^  [4]  has  useful  properties; 

(3.1)  P‘^(x  I 0)  = ; p(x  | 6)  - eq(x)e"®^^*^  , (x  ^ 0) 

where  Q(x)  is  a monotone  non-decreasing  function  (Q(0)  » 0)  , and 
q(x)  “ dQ(x)/dx  . We  note: 

1.  eQ(x)  is  the  cumulative  hazard  (failure)  function,  making  q(x) 
a unit-  or  prototype  failure  rate; 

2.  If  w is  a random  variable  with  absolute  failure  rate,  q(w)  , 
and  0 is  an  integer,  the  original  lifetime,  x , has  a physical 
interpretation  as 


X 


min 


3.  This  family  Includes  the  exponential,  Welbull,  and  Gumbel 
(extreme-value)  distributions. 


I 


The  data  likelihood  (2.2)  becomes: 
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(3.2)  P(I?  I e)  - n q(x  , 

ieC  ^ 

where  TQT  Is  a statistic, 

N 

(3.3)  TQT(P)  - I Q(x  ) - I Q(x  ) + I Q(T  ) , 

i-1  ^ ieC  ^ UC  ^ 

referred  to  in  [4]  as  the  total-Q-on-test-atatiatio , a generalization  of 
the  "total-tlme-on-test"  concept  of  reliability  life-testing.  Note  that 
if  item  k was  already  age  (and  still  working)  when  placed  on  test, 

then  Q(Sj^)  should  be  subtraated  from  the  TQT. 

A convenient  natural  conjugate  prior  for  6 is  the  gamma  density, 

(3.4)  p(0)  “ p(0  1 Cq,Qq)  - 

with  hyperparameters  > Qq  > bhe  usefulness  of  (3.4)  in  modelling  uni- 
modal  densities  is  well  known.  It  is  easy  to  see  that  Bayes'  law  then 
gives  a posterior-to-data  density  of  the  parameter,  p(0  1 P)  , that  is 
also  gamma,  with  updated  parameters: 

(3.5)  Cq  Cq  + C ; Qq  Qq  + TQT(P)  . 

Furthermore,  the  updated  means  of  0 and  0 ^ obey  the  exact  credibility 
formulae: 

(3.6)  [E{0  I - (1  - Z^)[E{0}]”^  + • 


^o(v)  ° 


Cn-1  -ePr 


r(c„) 


, (0  > 0) 


(3.7) 


E{r^  I P}  - (1  ~ Z2)E{e~h  + , 


with  credibility  factors: 


(3.8)  - C/(Cq  + C)  ; Z2  - C/(Cq  - 1 + C)  . 


The  po8terlor-to-data  variances  are  also  easily  obtained: 


(3.9) 


(3.10) 


1/(6  1 0} 


(s  ’=) 


[E{6  1 mr  , 


I ^ ■ (c,--Vvc)  I ■»]'  I 


the  first  terms  decrease  with  Increasing  C , and  so,  ultimately,  with 
probability  one,  do  the  variances.  This  makes  precise  the  difference  be- 
tween Incomplete  and  complete  samples;  two  different  data  sets  could  lead 
to  the  same  mean  forecast,  but  we  would  have  more  "confidence"  In  the  re- 
sult with  the  larger  number  of  complete  samples. 

The  terms  In  square  brackets  In  (3.6)  (3.7)  are  the  classical  maxlmum- 
llkellhood  estimators  got  from  the  term  In  square  brackets  In  (3.2).  If 
the  experiment  gives  a large  number  of  complete  observations,  relative  to 
Cq  , then  the  Bayesian  and  maxlmum-llkellhood  estimators  coincide.  However, 
for  relatively  Incomplete  tests,  more  weight  Is  given  the  prior  means, 

E{0}  - Cq/Qq  , or  E{0"^}  - Qq/(Cq  - 1)  . 

Classical  estimators  are  often  obtained  from  Bayesian  formulae  when  the 
prior  knowledge  becomes  "diffuse";  In  our  model  this  corresponds  to  keeping 
E{0}  or  E{0  fixed,  and  letting  the  corresponding  variances  (the  prior 
uncertainty)  Increase  without  limit.  From  (3.9)  (3.10)  we  see  this  corres- 
ponds to  letting  Cq  -*■  0 or  Cq  2 , respectively  (with  corresponding 
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Qq)  • Thus,  with  very  unoertain  prior  knowledge,  we  get; 

£{0*^  i V)  - iLC^.>.^+^TqT(P?  ^ 

Thus,  when  estimating  0 , we  place  "full  credibility"  in  the  maximum- 
likelihood  estimator,  and  ignore  all  prior  information;  but,  when  estima- 
ting 0 ^ , a Bayesian  would  always  insist  on  keeping  the  prior  mean  as  an 
initial  data  point,  because  the  Inverse  gamma  is  still  informative. 

The  mixed,  or  predictive  distribution  of  x , averaged  over  all 
possible  values  of  0 , is: 

(3.13)  I Cq,Qq)  - [qo/(Qo  + Q(x))j^°  , (x  > 0) 
with  density 

(3.1A)  p(x  I Cq,Qq)  - (CQq(x)/QQ)[^QQ/(QQ  + QCx))^®  ^ , 

a generalization  of  the  shifted  Pareto  distribution.  If  the  prototype 
failure  function  is  Gumbel,  we  get  exponential  tails  for  large  x in 

(3.13) ,  while  if  the  underlying  failures  are  Welbull,  we  get  the  "more 
dangerous"  algebraic  tails.  Posterior-to-the-data,  predictive  density  is 
of  the  same  form,  but  with  updated  parameters. 

The  cumulative  hazard  function  of  the  mixed  distribution  is: 


adjustments  in 

(3.11) 

(3.12) 


(3.15)  R(x  1 Cq.Qq)  - -In  P®(x  j C^.Q^)  - Cq  In  [1  + (Q(x)/Qg)l  . 
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One  can  show  that  this  mixing  tends  to  decrease  the  rate  of  failure;  in 
fact,  the  mixed  population  may  have  approximately  cG.*stant  or  decreasing 
hazard  rate,  even  with  increasing  q(x)  . 

Life  testing  applications  are  covered  in  more  detail  in  [4],  and  the 
problem  of  model  Identification  of  the  form  of  Q is  also  considered.  We 
turn  now  to  applications  of  these  ideas  in  risk  theory. 
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4.  RENEWAL  PROCESSES 

In  one  model  of  the  collective  risk  process,  claims  are  assumed  to 
follow  a renewal  process.  If,  during  an  exposure  Interval  T , C events 
(accidents,  claims,  equipment  failures,  etc.)  are  observed,  this  means 
there  are  C complete  Interval  samples  {x^^}  , and  the  final  Interval-ln- 
C 

progress,  T - ^ x.  . If  all  Intervals  are  sampled  from  (3.1)  with  fixed 

1-1  ^ 

6 , the  parameter  updating  becomes: 

% 

(4.1)  Cq  ^ Cq  + C ; Qq  ^ Qq  + Q(x^)  + Q^T  - x^j  . 

Note  that  not  only  the  random  number  of  events  In  (0,T]  , but  also  the 
actual  lengths  of  the  Intervals  provide  Information  In  the  general  case. 

An  Important  special  case  In  risk  processes  occurs  when  Q(x)  - x , 
leading  to  exponentially-distributed  intervals,  and  a Poisson  counting  pro- 
cess, for  each  6 . However,  here 

(^•2)  Qq  ^ ^0  ^ *1  ^ ^ *i)  * % • 

so  we  conclude  that  the  Poisson  process  Is  special  In  that  only  the  number 
of  events  in  (0,T]  , not  the  epochs  of  events,  provides  predictive  Informa- 
tion! 

If  the  claim  amount  (severity)  distribution  Is  Independent  of  6 (the 
usual  assumption),  then  the  above  Is  sufficient  to  predict  total  losses 
from  the  compound  distribution;  In  the  contrary  case,  additional  modelling 
and  updating  are  necessary. 
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5.  INDIVIDUAL  LEARNING  ABOUT  REMAINING  LIFE 

We  turn  now  to  the  Interesting  question  of  whether  or  not  a Bayesian 
can  learn  about  his  own  remaining  lifetime  distribution  function  (rldf). 

For  a mixed  population  with  average  tail  distribution 

(5.1)  Pr  {x  > T + u I X > T}  - - P^(u) 

P^(T) 

V 

represents  the  fraction  of  those  individual  components  alive  (operating)  at 
age  T which  will  survive  until  age  T + u . 

However,  for  a single  life  component  with  known  parameter  6 , the 
appropriate  rldf  is: 

(5.2)  Pr  {x  > T + u I X > T ; 6}  - t “ .1  ■ . pC(u  I 9)  , 

P‘'(T  I 0)  ^ 

If  we  have  to  estimate  this  single  life  behavior  as  averaged  over  the 
population  (i.e.,  without  Bayesian  learning),  we  get  the  prior  expected 
rldf: 


(5.3)  e{p®(u  1 0)1  - f ^ I p(0)d0  , 

^ ^ ^ P^T  I 0) 

which  is  clearly  not  Identical  with  (5.1). 

Now  let  us  adopt  the  Bayesian  point  of  view,  and  estimate  the  remain- 
ing life  of  a single  Individual  who  has  lived  to  age  T ; since  he  is  still 
alive,  we  have  the  single  datum  P > {x  > 1}  , which  must  update  the 
parameter  density  to: 


(5.4) 


p(0  1 P) 


P^T  I 0)p(0)  P*^(T  I 0)p(0)  ^ 

Jp*^^  I ♦)p(^)d^  ^ 
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So  Che  Bayeslan-updated  rldf  will  be 

r 

(5.5)  E te(»  I e)J  - f “ I.  »)  je  , 

6|p’  ^ ^ P'(T) 

which  Is  exactly  the  same  as  the  population  rldf  In  (5.1)!  Stated  another 
way,  a single  life  (or  component)  cannot,  on  the  average,  gather  any  addi- 
tional Information  about  his  remaining  lifetime  distribution  by  the  mere 
passage  of  time,  other  than  Chat  given  for  the  population  as  a whole — even 
though  he  can  learn  about  his  parameter!  A surprising,  but  satisfying 


result. 
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6.  COHORT  LEARNING  ABOUT  REMAINING  LIFE 

This  does  not  mean,  however,  that  several  Incomplete  samples  cannot 
provide  Information  about  other  lifetimes  with  the  same  6 , nor  that  a 
group  of  lives  with  the  same  6 cannot  learn  from  the  passage  of  time. 
Consider  a cohort  of  N lives  with  the  same  parameter  which  are  put  "on 
test"  at  the  same  epoch.  From  Section  2,  with  - T for  all  1 , we  see 
that  the  data  V ■ {x^  ^ T (1  e C)  ; C}  changes  (5.5)  to: 


E {P‘^(u  I 0)} 
l\V 


(6.1) 


J*  p‘^(T  + u 1 e) 


H-C-1 

P®(T  1 9) 

n p(x^  1 0)p(0) 

p‘'(P) 


de  , 


where  learning  would  clearly  take  place. 
For  the  proportional-hazard  family. 


(6.2) 


PjCu  1 ®) 


g-e[Q(T+u)-Q(T)] 


If  the  prior  at  T > 0 Is  gamma  with  hyperparameters  and  Qq  , the 

posterior- to-data  density  of  9 at  T Is  gamma  with  hyperparameters 
Cq  + C and  + (H  - C)Q(T)  + ^ Q(x^)  , giving  finally  the  special  cohort- 
experienced  remalnlng-llfetlme  distribution  function: 


(6.3) 


Qq  + (N  - C)Q(T)  + I Q(x^) 

(N  - 1 - C)Q(T)  + I Q(x^)  + Q(T  + u) 


Cq+C 


It  Is  easy  to  see  how  learning  vanishes  when  N • 1 and  C ■ 0 . 
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7.  LIFE  CONTINGENCIES  AND  RESERVES 

To  apply  the  results  above,  consider  that  we  are  determining  the  net 
single  premium  for  a continuous  life  annuity  of  $ 1/year,  at  force  of  inter- 
est 6 , for  an  individual  aged  x . Given  0 , this  would  be  (we  omit  the 
usual  overbar  notation) : 


(7.1)  " /*”^'^x 


(u  I 9)du 


-6u-[Q(x+u)-Q(x)], 


Let  us  suppose  that  the  prior  on  9 is  gamma  with  hyperparameters 

at  the  moment  of  underwriting  (age  x) . The  population-average  annuity  fair 


premium  is  then: 


(7.2) 


where 


j(Cj,Q^)  - J + Q^(u)j  ^^du  , 


(7.3) 


Qj^(u)  - Q(x  + u)  - Q(x) 


is  the  prototype  cumulative  failure  function  for  the  remaining  life,  begin- 
ning at  age  x . 

Now,  suppose  we  have  Insured  a cohort  of  N lives  aged  x , all  of 
whom  have  the  same  parameter,  and  let  us  follow  the  cohort  for  t addi- 
tional years.  During  this  time  the  data  provided  by  the  C expirations  at 
additional  ages  {t^}  , together  with  f fact  that  N - C lives  are  still 
in  existence  at  age  x + t , would  update  the  hyperparameters  to: 

(7.4)  C2  - C.  + C ; Q2(t)  - Q + (N  - C)0  (t)  + I 0 (t  ) . 

leC 
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Although  it  is  too  late  to  change  the  premium,  this  additional  knowl- 
edge could  be  useful  in  adaptive  modification  of  the  reserves  on  the  N - C 
outstanding  policies;  for  a single-premitm  annuity  of  $l/year  still  out- 
standing at  age  x + t , the  correct  adaptive  reserve  would  be: 

(7.5)  ^V(a^)  - a^j.(C2.Q2(t))  . 

We  remind  the  reader  that  C2  and  Q2(t)  will  be  random  outcomes,  depend- 
ing upon  actual  cohort  experience  during  ages  (x,x  + t]  . Only  when  there 
is  a single  incomplete  life  under  observation  (C2  “ ; Q2(t)  ■ 

will  no  learning  take  place,  and  the  reserves  will  follow  the 
classic  result  for  an  average  member  of  the  mixed  population: 


(7.6) 


x+t 


■/ 


-6u  , 

e — du  - a (C 

P (t) 

X 


A similar  development  could  be  given  in  terms  of  the  net  single  pre- 
mium for  a life  assurance  of  $1,  at  force  of  interest  6 , payable  at  the 
Instant  of  death  of  an  individual  now  aged  x , 


(7.7) 


■/ 


-du  , 
e p^(u 


6)du 


The  appropriate  formulae  follow  from  the  previous  results  by  the  universal 
relation  A * 1 - 5a  . 

X X 

It  is  of  interest  to  follow  through  the  actual  stochastic  behavior  of 
a "learning  reserve"  of  the  type  (7.S).  First  of  all,  we  note  that  adap- 
tive annuity  reserves  do  not  decrease  as  quickly  as  the  corresponding 
e^^  , for  small  t and  C • 0 , which  can  be  seen  from: 
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(7.8) 


dt 

C q(x  + t) 

Q2(t) 

- (N  - C - l)a 


{(»  - C)a^^(Cj.Q2<t)) 
+ 1 , Q^Ct))) 


as  compared  with  the  well-known  classical  result 

S • 

x+t 

The  term  in  square  brackets  is,  of  course,  the  failure  rate  at  x+t  for 
the  mixed  population  In  the  proportional-hazard  family,  i.e. , the  deriva- 
tive of  (3.15).  When  the  first  and  subsequent  deaths  occur,  there  Is  an 
Instantaneous  drop  In  (7.5),  since  C2  Increases  by  unity,  but  Q2(t)  Is 
continuous.  In  general.  If  fewer  (more)  lives  than  expected  terminate 
during  (x,x  + t]  , the  reserves  on  the  remaining  lives  are  larger  (smaller) 
than  usual,  since  this  Indicates  that  the  value  of  6 Is  smaller  (larger) 
than  average  for  this  cohort.  A complementary  effect  occurs  for  life  assur- 
ance learning  reserves. 

It  should  be  mentioned  that  a gamma-mixed  proportional-hazard  model 
should  be  used  with  care  for  human  mortality.  If,  for  example,  the  proto- 
type failure  rate  Is  assumed  to  follow  Makeham's  law,  q(t)  ~ A + Be^^  , we 
find  that  the  mixed  hazard  rate  (the  derivative  of  (3.15))  Is  asjmptotlcally 
constant,  due  to  the  f allure-rate-decreasing  properties  of  mixing!  One 
would  have  to  assume  that,  given  6 , Individuals  follow  a much  stronger 
"wear-out"  (say,  Gumbel),  In  order  to  obtain  a population  Makeham-type  law. 

It  Is  Interesting  to  speculate  as  to  whether  or  not  this  occurs  for  closely- 
matched  humans,  where  6 would  have  to  Include  health,  genetic,  and  cnvlron- 


da 


(7.9) 


xft 


dt 


6a  , - 1 + 

x+t 


C2q(x  + t) 


mental  effects. 
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8.  DEFECTIVE  DISTRIBUTIONS 

Component  and  human  lives  are  finite,  with  probability  one;  however, 
defective  distributions  arise  In  a variety  of  other  operational  situations. 
Consider,  for  example,  the  estimation  of  the  time  It  takes  for  a number  of 
requests  for  bids,  mailed  survey  responses,  etc.,  to  be  returned.  Some 
responses  are  received  rather  quickly;  In  other  cases,  an  answer  Is  never 
received . 

A reasonable  model  for  this  situation  would  add  an  unknown  defeat 
parameter t , to  the  usual  lifetime  distribution,  as  follows: 


(8.1)  P‘=(x  1 e,*)  -♦+(!-  (^)P'^(X  1 6)  ; p(x  | 0,^)  - (1  - ♦)p(x  ] 0)  . 

Is  then  the  probability  that  the  lifetime  Is  "Infinite." 

Under  the  life  testing  scheme  of  Section  2,  the  likelihood  of  the  data 
set  P becomes : 

(8.2)  p(0  1 0,«^)  -{p)(l  - 4.)^  n p(x.  1 0)  n L + (1  - 1 0)1  . 

leC  ^ j^C  L J J 


Assuming  all  the  intervals 
to-data  density  of  0 and 


Tj  have  common  value  T , we  find  the  posterior- 
^ by  a binomial  expansion: 


p(0,(|)  I V)  - 

^ ^ K i (1  - ♦)**"^  n p(x  I 0)P‘^(T  I 0)”"^'^p(0,«^) 

J-0  V J / leC  ^ 

where  K Is  a normalizing  constant  to  make  p(0,^  | P)d0d(^  > 1 . To 
illustrate  the  calculations  further,  assume  that  the  "honest"  part  of 
(8.1),  P^(x  I 0)  , Is  from  the  proportional-hazard  family  (3.1),  with 
gamma  prior  on  0 (3.4).  For  simplicity,  assume  Is,  a priori.  Inde- 


pendent of  0 , and  has  a beta  prior  density: 
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. a -1/  \b  -1 

(8.4)  p(^)  - p((^  I a^.b^)  - B"^(a^,b^)4>  ° U - <f/  ° (0  ^ < 1)  . 

B(a  ,b  ) is  the  beta  function,  r(a  )r(b  )/r(a  + b ) . After  stralght- 
o o o o o o 

forward  calculations  with  these  special  forms,  we  find  the  mixed  beta- 


p(e,*  I V)  - 


(8.5)  N-C 

I n (P)p(<^  ! + J 

j-0  J ° 


b^  + N - j)p(0  I Cq  + C , Qq  + TQTj(l?))  , 


where 


(8.6)  TQT,(P)  - I Q(x  ) + (N  - C - j)Q(T)  , 

J leC  ^ 


and  the  mixing  probabilities  are  given  by: 

(■ 

(8.7)  nj(P)  - K • B(a^  + j , b^  + N - j)|  Qq  + TQrj(P)J 

where,  again,  K Is  a normalizing  factor  to  make  J 11^  ■ 1 . It  Is  Impor- 
tant to  note  that,  posterlor-to-data,  the  estimates  of  6 and  4)  are 
dependent,  unless  all  of  the  observations  are  complete.  For  estimating  the 
mean  defect,  we  have 

N“C 

(8.8)  E{^  I P}  - I n/P) 

j-0  J 

:dierc  we  recognize  the  term  In  square  brackets  as  the  mean  of  4)  , given 
only  that  we  observe  j defects  out  of  N trials.  For  N - 1 and  no 
failure : 


a + b + N 
o o 
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(8.9) 


E{*  1 P} 


a 

o 

1 + Q(T)/Qq 

a + b +1 

0 o 

a 

0 

1 + Q(T)/Qq 

o 

which  shows  clearly  how  the  mean  defect  Increases  from  its  original  esti- 
mate of  a / (a  + b ) towards  (a  + 1)  / (a  + b +1)  as  T •+■  “>  with  no 
0 o o o o o 

failure.  Of  course,  if  the  lifetime  ever  terminates,  E{(|)  | V]  jumps  to 
(b  + l)/(a  + b + 1)  . Other  mixing  models  are  given  in  [4]. 
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